Contour vs Non-Contour based Word Segmentation from Handwritten Text Lines- an Experimental Analysis
نویسندگان
چکیده
This paper compares contour based and noncontours based techniques for extracting words from unconstrained handwritten text lines. Proposed novel approach is based on contours of the words rather only considering threshold for inter-word gaps as previous studies. In this approach, contour of each word is examined along with threshold for inter-word gaps to extract words with high confidence. Unlike previous studies, preprocessing technique is not applied, that enhance the speed significantly. Furthermore, a simple technique for punctuation detection is proposed to increase accuracy of word extraction. For fair comparison text lines are taken randomly from IAM benchmark database and threshold calculation is kept same for all techniques. Experiments thus performed, exhibit improved results and speed over the conventional word extraction methods. Furthermore, developed techniques and results are compared with the other approaches available in the literature using same benchmark database.
منابع مشابه
Statistical Approach for Segmenting Unconstrained Handwritten Text lines
The segmentation of unconstrained handwritten text lines into words is an important stage in word recognition systems. This paper addresses a methodology to overcome the challenges, which are amplified by the non-uniform spaces between words and overlapping components by using a few statistical approaches. The system was developed using Java 2 and ImageJ tool. In this approach, a text line imag...
متن کاملA Modified Character Segmentation Algorithm for Farsi Printed Text Using Upper Contour Labelling
In this paper, a modified segmentation algorithm for printed Farsi words is presented. This algorithm is based on a previous work by Azmi that uses the conditional labeling of the upper contour to find the segmentation points. The main objective is to improve the segmentation results for low quality prints. To achieve this, various modifications on local baseline detection, contour labeling an...
متن کاملA Modified Character Segmentation Algorithm for Farsi Printed Text Using Upper Contour Labelling
In this paper, a modified segmentation algorithm for printed Farsi words is presented. This algorithm is based on a previous work by Azmi that uses the conditional labeling of the upper contour to find the segmentation points. The main objective is to improve the segmentation results for low quality prints. To achieve this, various modifications on local baseline detection, contour labeling an...
متن کاملRegion growing based segmentation algorithm for typewritten and handwritten text recognition
This paper presents a new technique of high accuracy to recognize both typewritten and handwritten English and Arabic texts without thinning. After segmenting the text into lines (horizontal segmentation) and the lines into words, it separates the word into its letters. Separating a text line (row) into words and a word into letters is performed by using the region growing technique (implicit s...
متن کاملCharacters Segmentation of Cursive Handwritten Words based on Contour Analysis and Neural Network Validation
This paper presents a robust algorithm to identify the letter boundaries in images of unconstrained handwritten word. The proposed algorithm is based on vertical contour analysis. Proposed algorithm is performed to generate presegmentation by analyzing the vertical contours from right to left. The unwanted segmentation points are reduced using neural network validation to improve accuracy of se...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- JDCTA
دوره 3 شماره
صفحات -
تاریخ انتشار 2009